PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gh_A12G1936
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family Trihelix
Protein Properties Length: 425aa    MW: 47699.6 Da    PI: 8.7257
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gh_A12G1936genomeNAU-NBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix73.53.6e-2325113186
     trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkm...rergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqle 86 
                  +Wtk+e+laLi+a+++++ +lrrg+lk+ +W++vs+++   ++ g  +s++qC++k+e+l+kry+++k+++ k++ + ss++ ++  l+
  Gh_A12G1936  25 CWTKDETLALIDAYKDKWFALRRGNLKASDWDAVSDVVssaSDPGTVKSSVQCRHKIEKLRKRYRAEKQRSLKNSGKFSSSWDLYPLLD 113
                  8*************************************998899***************************************998876 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF138376.8E-2124115No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0005634Cellular Componentnucleus
Sequence ? help Back to Top
Protein Sequence    Length: 425 aa     Download sequence    Send to blast
MSTTTSPPLP QPSSATAARR VPPPCWTKDE TLALIDAYKD KWFALRRGNL KASDWDAVSD  60
VVSSASDPGT VKSSVQCRHK IEKLRKRYRA EKQRSLKNSG KFSSSWDLYP LLDSMNFAST  120
SVAGSDDQDH TIDHRVTVFG DFCLKSNKHE NIDGNSGSNL GFDREFRGGY NSSFNFDHKW  180
QENGGFVAKG IKKFKSDGRI GDGYGSMVDF DHSFGQHVDG LGEFPLKTLG DRSFLNVGFK  240
SKNYGCPNLN YDYDNDSKEY SIDEEMGFRA RDSGAWDSVP QGIHQKKRGR VDMNFEPGGD  300
CRGLNGDASC SRPGLERKNA GAGVKRGVDP VDEMVSSIKL LAEGFVRMEK MKMEMLKEIE  360
KMRMEMEMKH NEMILESQQK IVDAFSSALL SEKKKKKKKP SLMFSNMNGN GVGEWQEDAF  420
IKKER
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1391399EKKKKKKKP
2392398KKKKKKK
Expression -- UniGene ? help Back to Top
UniGene ID E-value Expressed in
Ghi.211610.0boll
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_012439411.10.0PREDICTED: uncharacterized protein LOC105765057
TrEMBLA0A0D2T5E10.0A0A0D2T5E1_G
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM104831828
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G44730.17e-24Trihelix family protein